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Abstract 

The problem of 1-bit compressive sampling is addressed in this paper. We introduce an optimization 
. model for reconstruction of sparse signals from 1-bit measurements. The model targets a solution 

that has the least £o-norm among all signals satisfying consistency constraints stemming from the 
I 1-bit measurements. An algorithm for solving the model is developed. Convergence analysis of the 

^ ■ algorithm is presented. Our approach is to obtain a sequence of optimization problems by successively 

D i approximating the fo-norm and to solve resulting problems by exploiting the proximity operator We 

. examine the performance of our proposed algorithm and compare it with the binary iterative hard 

I thresholding (BIHT) ifTOl a state-of-the-art algorithm for 1-bit compressive sampling reconstruction. 

Unlike the BIHT, our model and algorithm does not require a prior knowledge on the sparsity of the 
signal. This makes our proposed work a promising practical approach for signal acquisition. 
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I. Introduction 

Compressive sampling is a recent advance in signal acquisition BH, [[5l. It provides a method 
to reconstruct a sparse signal x G M" from linear measurements 

y = '^x, (1) 

where $ is a given m x n measurement matrix with m < n and y E is the measurement 
vector acquired. The objective of compressive sampling is to deliver an approximation to x from 
y and $. It has been demonstrated that the sparse signal x can be recovered exactly from y if ^ 
has Gaussian i.i.d. entries and satisfies the restricted isometry property [13. Moreover, this sparse 
signal can be identified as a vector that has the smallest ^Q-norm among all vectors yielding the 
same measurement vector y under the measurement matrix $. 

However, the success of the reconstruction of this sparse signal is based on the assumption that 
the measurements have infinite bit precision. In realistic settings, the measurements are never 
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exact and must be discretized prior to further signal analysis. In practice, these measurements 
are quantized, a mapping from a continuous real value to a discrete value over some finite range. 
As usual, quantization inevitably introduces errors in measurements. The problem of estimating 
a sparse signal from a set of quantized measurements has been addressed in recent literature. 
Surprisedly, it has been demonstrated theoretically and numerically that 1-bit per measurement 
is enough to retain information for sparse signal reconstruction. As pointed out in Q, [UOll . 
quantization to 1-bit measurements is appealing in practical applications. First, 1-bit quantizers 
are extremely inexpensive hardware devices that test values above or below zeros, enabling 
simple, efficient, and fast quantization. Second, 1-bit quantizers are robust to a number of non- 
linear distortions applied to measurements. Third, 1-bit quantizers do not suffer from dynamic 
range issues. Due to these attractive properties of 1-bit quantizers, in this paper we will develop 
efficient algorithms for reconstruction of sparse signals from 1-bit measurements. 

The 1-bit compressive sampling framework originally introduced in ^ is briefly described as 
follows. Formally, it can be written as 

y = A{x) := sign($x), (2) 

where the function sign(-) denotes the sign of the variable, element-wise, and zero values are 
assigned to be +1. Thus, the measurement operator A, called a 1-bit scalar quantizer, is a 
mapping from to the Boolean cube { — 1, 1}™. Note that the scale of the signal has been lost 
during the quantization process. We search for a sparse signal x* in the unit ball of M™ such 
that the sparse signal x* is consistent with our knowledge about the signal and measurement 
process, i.e., A{x*) = A{x). 

The problem of reconstructing a sparse signal from its 1-bit measurements is generally non- 
convex, and therefore it is a challenge to develop an algorithm that can find a desired solution. 
Nevertheless, since this problem was introduced in flS] in 2008, there are several algorithms that 
have been developed for attacking it [[31, [[T2|. [fTTl . f[T9l . Among those existing 1-bit compressive 
sampling algorithms, the binary iterative hard thresholding (BIHT) ifTOll exhibits its superior 
performance in both reconstruction error and as well as consistency via numerical simulations 
over the algorithms in [|3l, [[T2]. When there are a lot of sign flips in the measurements, a 
method based on adaptive outlier pursuit for 1-bit compressive sampling was proposed in [|T9l . 
The algorithms in [[TOl . [|T9l require the sparsity of the desired signal to be given in advance. 
This requirement, however, is hardly satisfied in practice. By keeping only the sign of the 
measurements, the magnitude of the signal is lost. The models associated with the aforementioned 
algorithms seek sparse vectors x satisfying consistency constraints ^ in the unit sphere. As a 
result, these models are essentially non-convex and non-smooth. In ifTTl . a convex minimization 
problem is formulated for reconstruction of sparse signals from 1-bit measurements and is solved 
by linear programming. The details of the above algorithms will be briefly reviewed in the next 
section. 

In this paper, we introduce a new io minimization model over a convex set determined by 
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consistency constraints for 1-bit compressive sampling recovery and develop an algorithm for 
solving the proposed model. Our model does not require prior knowledge on the sparsity of the 
signal, therefore, is referred to as the blind 1-bit compressive sampling model. Our approach for 
dealing with our proposed model is to obtain a sequence of optimization problems by successively 
approximating the £o-norm and to solve resulting problems by exploiting the proximity operator 
lfT6l . Convergence analysis of our algorithm is presented. 

This paper is organized as follows. In Section HI] we review and comment current 1-bit 
compressive sampling models and then introduce our own model by assimilating advantages 
of existing models. Heuristics for solving the proposed model are discussed in Section Hill 
Convergence analysis of the algorithm for the model is studied in Section |lVl A numerical 
implementable algorithm for the model is presented in Section |Vl The performance of our 
algorithm is demonstrated and compared with the BIHT in Section |VIl We present our conclusion 
in Section |Vnl 

II. Models for One-Bit Compressive Sampling 

In this section, we begin with reviewing existing models for reconstruction of sparse sig- 
nals from 1-bit measurements. After analyzing these models, we propose our own model that 
assimilates the advantages of the existing ones. 

Using matrix notation, the 1-bit measurements in dD can be equivalently expressed as 



where Y := diag(|/) is an m x m diagonal matrix whose ith diagonal element is the ith entry 
of y. The expression Y^x > in ([3]) means that all entries of the vector F$x are no less than 
0. Hence, we can treat the 1-bit measurements as sign constraints that should be enforced in 
the construction of the signal x of interest. In what follows, equation (|3]) is referred to as sign 
constraint or consistency condition, interchangeably. 

The optimization model for reconstruction of a sparse signal from 1-bit measurements in tSj 

is 



where || ■ ||i and || ■ II2 denote the £i-norm and the £2-norm of a vector, respectively. In model (jU, 
the £i-norm objective function is used to favor sparse solutions, the sign constraint Y^x > is 
used to impose the consistency between the 1-bit measurements and the solution, the constraint 
II a; II 2 = 1 ensures a nontrivial solution lying on the unit £2 sphere. 
Instead of solving model (H)) directly, a relaxed version of model (SI) 



F$x > 0, 



(3) 



min ||a:||i s.t. Y^x > and ||x||2 = 1 



(4) 




(5) 



4 



was proposed in [|3l and solved by employing a variation of the fixed point continuation algorithm 
in BU. Here A is a regularization parameter and h is chosen to be the one-sided ii (or £2) function, 
defined at z G M as follows 

(6) 

1 0, otherwise. 

We remark that the one-sided £2 function was adopted in ^ due to its convexity and smoothness 
properties that are required by a fixed point continuation algorithm. 

In lfT2ll a restricted-step-shrinkage algorithm was proposed for solving model dU. This algo- 
rithm is similar in sprit to trust-region methods for nonconvex optimization on the unit sphere 
and has a provable convergence guarantees. 

Binary iterative hard thresholding (BIHT) algorithms were recently introduced for reconstruc- 
tion of sparse signals from 1-bit measurements in [fTOl . The BIHT algorithms are developed for 
solving the following constrained optimization model 

m 

min h{{Y^x)i) s.t. ||x||o < s and ||x||2 = 1, (7) 
1=1 

where h is defined by equation s is a positive integer, and the ^o-norm ||x||o counts the 
number of non-zero entries in x. Minimizing the objective function of model © enforces the 
consistency condition ([3]). The BIHT algorithms for model (17]) are a simple modification of 
the iterative thresholding algorithm proposed in [2]. It was shown numerically that the BIHT 
algorithms perform significantly better than the other aforementioned algorithms in [|3l, [|T2| in 
terms of both reconstruction error as well as consistency. Numerical experiments in [[TOl further 
show that the BIHT algorithm with h being the one-sided £1 function performs better in low noise 
scenarios while the BIHT algorithm with h being the one-sided £2 function perform better in 
high noise scenarios. Recently, a robust method for recovering signals from 1-bit measurements 
using adaptive outlier pursuit was proposed for the measurements having noise (i.e., sign flips) 
in im. 

The algorithms reviewed above for 1-bit compressive sampling are developed for optimization 
problems having convex objective functions and non-convex constraints. In ifTTl a convex opti- 
mization program for reconstruction of sparse signals from 1-bit measurements was introduced 
as follows: 

min||x||i s.t. Y^x>0 and ||$x||i=p, (8) 

where p is any fixed positive number. The first constraint Y^x > requires that a solution 
to model ^ should be consistent with the 1-bit measurements. If a vector x satisfies the first 
constraint, so is ax for all < a < 1. Hence, an algorithm for minimizing the £i-norm by only 
requiring consistency with the measurements will yield the solution x being zero. The second 
constraint = p is then used to prevent model dH) from returning a zero solution, thus, 

resolves the amplitude ambiguity. By taking the first constraint into consideration, we know that 
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= {y,^x), therefore, the second constraint becomes = p. This confirms that 

both objective function and constraints of model ([8]) are convex. It was further pointed out in 
[fTTl that model ([8]) can be cast as a linear program. As comparing model dS]) with model 
both the constraint ||x||2 = 1 in model (Hj) and the constraint ||$a;||i = p in model ([8]), the only 
difference between both models, enforce a non-trivial solution. However, as we have already 
seen, model dH]) with the constraint = p can be solved by a computationally tractable 

algorithm. 

Let us further comment on models (|7]l and ([8]). First, the sparsity constraint in model ^ 
is impractical since the sparsity of the underlying signal is unknown in general. Therefore, 
instead of imposing this sparse constraint, we consider to minimize an optimization model having 
the £o-norm as its objective function. Second, although model dH) can be tackled by efficient 
linear programming solvers and the solution of model (|8) preserves the effective sparsity of 
the underlying signal (see ^]), the solution is not necessarily sparse in general as shown in 
our numerical experiments (see Section IVI] ). Motivated by the aforementioned models and the 
associated algorithms, we plan in this paper to reconstruct sparse signals from 1-bit measurements 
via solving the following constrained optimization model 

min ||x||o s.t. Y^x>0 and ||$a;||i=p, (9) 

where p is again a arbitrary positive number. This model has the £o-norm as its objective function 
and inequality F$x > and equality = p as, its convex constraints. 

We remark that the actual value of p is not important as long as it is positive. More precisely, 
suppose that S and S'^ are two sets collecting all solutions of model ^ with p = 1 and 
p = p^ > 0, respectively. If x E S, that is, F$x > and = 1, then, by denoting 

X* := p'^x, it can be verified that ||x*||o = Ija^Ho^ Y^x'^ > 0, and ||$x*||i = p^. That indicates 
X* G S^. Therefore, we have that p'^S C 5^. Conversely, we can show that 5^ C p^S by 
reverting above steps. Hence, p'^S = S^. Without loss of generality, the positive number p is 
always assumed to be 1 in the rest part of the paper. 

To close this section, we compare model <^} and our proposed model ^ in the following 
result. 

Proposition 1: Let y E be the 1-bit measurements from an m x n measurement matrix 
$ via equation d2l) and let s be a positive integer. Assume that the vector x G M" is a solution 
to model Then model ^ has the unit vector -tAt- as its solution if ||x||o < s; otherwise, 

ll"^ II 2 

model dZ]) can not have a solution satisfying the consistency constraint if ||x||o > s. 

Proof: Since the vector x is a solution to model dll), then x satisfies the consistency 
constraint F$x > 0. Hence, it, together with definition of h in dS), implies that 

X 



=1 



We further note that 
model dU) if ||x||o < s. 



\\x\\2 



|x||o and 



1. Hence, the vector ,74- is a solution of 

IfI|2 
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On the other hand, if ||x||o > s then all solutions to model ([7]) do not satisfy the consistency 
constraint. Suppose this statement is false. That is, there exists a solution of model ©, say 



such that > 0, Wx^o < s, and ||x*||2 = 1 hold. Set := p|^- Then ||a;'^||o = Wx^o < s, 

Y^x^ > 0, and ||$a;*||i = 1. Since ||a;*||o < ll^^llo^ it tums out that x is not a solution of 
model (111). This contracts our assumption on the vector x. This completes the proof of the result. 

■ 

From Proposition [H we can see that the sparsity s for model ^ is critical. If s is set too large, 
a solution to model ([7]) may not be the sparsest solution satisfying the consistency constraint; if 
s is set too small, solutions to model (|7]) cannot satisfy the consistency constraint. In contrast, 
our model (|9]l does not require the sparsity constraint used in model dV]) and delivers the sparsest 
solution satisfying the consistency constraint. Therefore, these properties make our model more 
attractive for 1-bit compressive sampling than the BIHT. Since sparsity of the underlying signal 
is not specified in advance in model we refer it to as blind 1-bit compressive sampling 
model. 

III. An Algorithm for the Blind 1-Bit Compressive Sampling 

In this section, we will develop algorithms for the proposed model We first reformulate 
model ^ as an unconstrained optimization problem via the indicator function of a closed convex 
set in R™+^. It tums out that the objective function of this unconstrained optimization problem 
is the sum of the ^o-norm and the indicator function composing with a matrix associated with 
the 1-bit measurements. Instead of directly solving the unconstrained optimization problem we 
use some smooth concave functions to approximate the ^o-norm and then linearize the concave 
functions. The resulting model can be viewed as an optimization problem of minimizing a 
weighted ^i-norm over the closed convex set. The solution of this resulting model is served 
as a new point at which the concave functions will be linearized. This process is repeatedly 
performed until a certain stopping criteria is met. Several concrete examples for approximating 
the ^o-norm are provided at the end of this section. 

We begin with introducing our notation and recalling some background from convex analysis. 
For the rf-dimensional Euclidean space R.'^, the class of all lower semicontinuous convex functions 
/ : M"' ^ (-00, +oo] such that dom/ := {x G M"' : f{x) < +00} 7^ is denoted by ToiW^). 
The indicator function of a closed convex set C in M"' is defined, at u E M°', as 

0, if u e C; 
+00, otherwise. 

Clearly, i-c is in ro(M'^) for any closed nonempty convex set C. 

Next, we reformulate model dD as an unconstrained optimization problem. To this end, from 
the m X n matrix $ and the m-dimensional vector y in equation we define an (m + 1) x n 
matrix 

diag(?/) 



Lciu) :-- 



B :-- 



$ (10) 
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and a subset of 

C := {z : Zm+i = 1 and -Zj > 0, i = 1, 2, . . . , m}, (11) 

respectively. Then a vector x satisfies the two constraints of model © if and only if the vector 
Bx lies in the set C. Hence, model ^ can be rewritten as 

min{||x||o + ic(5x) : X G M"}. (12) 

Problem (fT2)) is known to be NP-complete due to the non-convexity of the £o-norm. Thus, there 
is a need for an algorithm that can pick the sparsest vector x satisfying the relation Bx G C. To 
attack this iQ-norm optimization problem, a common approach that appeared in recent literature 
is to approximate the ^o-norm by its computationally feasible approximations. In the context of 
compressed sensing, we review several popular choices for defining the ^Q-nomi as the limit of 
a sequence. More precisely, for a positive number e G (0, 1), we consider separable concave 
functions of the form 

n 

F,(x) := J]/,(|x,|), xGM", (13) 

i=l 

where : IR+ — R is strictly increasing, concave, and twice continuously differentiable such 
that 

lim FAx) = llxllo, for all x G M". (14) 

e-)-0+ 

Since the function is concave and smooth on IR+ := [0, oo), it can be majorized by a simple 
function formed by its first-order Taylor series expansion at a arbitrary point. Write T^ix, v) : = 
F^(v) + (VFe(|w|), |x| — \v\). Therefore, at any point t> G M" the following inequality holds 

F,{x) < J^,{x,v) (15) 

for all x G with ^ \v\. Here, for a vector u, we use \u\ to denote a vector such that 
each element of \u\ is the absolute value of the corresponding element of u. Clearly, when v is 
close enough to x, J-'e{x, v) the expression on the right-hand side of ([TST l provides a reasonable 
approximation to the one on its left-hand side. Therefore, it is considered as a computationally 
feasible approximation to the £o-norm of x. With such an approximation, a simplified problem 
is solved and its solution is used to formulate another simplified problem which is closer to the 
ideal problem (fT2l) . This process is then repeated until the solutions to the simplified problems 
become stationary or meet a termination criteria. This procedure is summarized in Algorithm [T] 
The terms F^dx^^^l) and {'VF^(\x^''^\),\x^'^^) appeared in the optimization problem in Al- 
gorithm \T\ can be ignored because they are irrelevant to the optimization problem. Hence the 
expression for x^'''^^'^ in Algorithm [T] can be simplified as 

e aigmin {{VF,{\x^''^\),\x\) + Lc{Bx) : a; G M"} . (16) 

Since is strictly concave and increasing on f'^ is positive on R_|_. Hence, (V F^{\x^''^\), = 
Y^7=i /e ( ki*^^ \ can be viewed as the weighted £i-norm of x having /^'( |a; | ) as its ith weight. 
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Algorithm 1 (Iterative scheme for model (fT2)) ) 



Initialization: choose e G (0, 1) and let G M" be an initial point. 
repeat(A; > 0) 

Step 1: Compute 

x^^+^^ G argmin { Ix^'^)]) + Lc{Bx) : x G M"} . 

until a given stopping criteria is met 

Thus, the objective function of the above optimization problem is convex. Details for finding a 
solution to the problem will be presented in the next section. 

In the rest of this section, we list several possible choices of the functions in (fT3l) including 
but not limited to the Mangasarian function in [|14| and the Log-Det function in 

The Mangasarian function is given as follows: 

n 

F,(x) = 5^(1-6-1^'!/^), (17) 

i=l 

where x G M". This function is used to approximate the ^o-norm to obtain minimum-support 
solutions (that is, solutions with as many components equal to zero as possible). The usefulness 
of the Mangasarian function was demonstrated in finding sparse solutions of underdetermined 
linear systems (see [[TT|). 

The Log-Det function is defined as 

fr^ log(l/e) 

where x G M". Notice that ||x||o is equal to the rank of the diagonal matrix diag(x). The function 
F^{x) is equal to (log(l/e))~'^ log(det(diag(x) + eJ)) + n, the logarithm of the determinant of the 
matrix diag(x) + el. Hence, it was named as the Log-Det heuristic and used for minimizing the 
rank of a positive semidefinite matrix over a convex set in [|8]. Constant terms can be ignored 
since they will not affect the solution of the optimization problem (fT6] ). Hence the Log-Det 
function in (fTSi) can be replaced by 

n 

F,(x) = ^log(|x,|+e). (19) 

i=l 

The function for the above three choices are plotted in Figure [Tlfor n = 1 and e being |, 
\, and ^. We can see that for a fixed e G (0, 1) the Mangasarian function is the one which 
is the most closest to the £o-norm. 

We point it out that the Mangasarian function is bounded by 1, therefore, is non-coercive 
while the Log-Det function is coercive. This makes difference in convergence analysis of the 
associated Algorithm [T] that will be presented in the next section. In what follows, the function 
Fe is the Mangasarian function or the Log-Det function. We specify it only when it is noted. 



9 




(a) Mangasarian (b) Log-Det 

Fig. 1. Plots of Fi, Fi, f j_, -Fj_ with n = 1 for (a) the Mangasarian function; (b) the Log-Det function. 

IV. Convergence Analysis 

In this section, we shall give convergence analysis for Algorithm [U We begin with presenting 
the following result. 

Theorem 2: Given e G (0, 1), G M", and the set C defined by ([11]), let the sequence 
{x^''^ : G N} be generated by Algorithm [H where N is the set of all natural numbers. Then 
the following three statements hold: 

(i) The sequence {F^(x^''^) : k E N} converges when is corresponding to the Mangasarian 
function (flTl) or the Log-Det function (fT9l ): 

(ii) The sequence {x^^^ : /c G N} is bounded when F^ is the Log-Det function; 

(iii) ^k=i II l^^^*'"''^'' I — |2;^'^^|||2 is convergent when the sequence {x'^'^^ : A; G N} is bounded. 
Proof: We first prove Item (i). The key step for proving it is to show that the sequence 

{F^{x^''^) : A; G N} is decreasing and bounded below. The boundedness of the sequence is due to 
the fact that -^^(0) < F^{x^''^). From Step 1 of Algorithm [T] or equation (fT6] ). one can immediately 
have that 

LciBx^''^'^) = 

and 

(VF.dx^'^)!), < (VF.dx^'^)!), \x^''^\). (20) 

By identifying x^''^ and respectively, as v and x in (fT5l) and using the inequality in (|20l) . 

we get < F^{x'^''^). Hence, the sequence {F^{x'^''^) : /c G N} is decreasing and bounded 

below. Item (i) follows immediately. 

When F^ is chosen as the Log-Det function, the coerciveness of F^ together with Item (i) 
implies that the sequence {x^'^^ : k E N} must be bounded, that is. Item (ii) holds. 

Finally, we prove Item (iii). Denote w^''^ := — From the second-order Taylor 

expansion of the function F^ at x^''^ we have that 

F,(a;('^+i)) = J-,(x('=+i\x('=)) + l-{w^''yV^F,{v)w^''\ (21) 
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where v is some point in the line segment linking the points jx^'^+^^l and \x^''^\ and V'^F^(v) is 
the Hessian matrix of at the point v. 

By (|20l) . the first term on the right-hand of equation (1211 is less than By equa- 

tion (fT9l ). V^Fe(f ) for V lying in the first octant of is a diagonal matrix and is equal to 
— ^diag(e~"^, e""?, . . . , e"^) or — diag((t'i + e)~^, {v2 + e)~'^, . . . (w„ + e)~^) which corresponds 
to being the Mangasarian or the Log-Det function. Hence, the matrix V^Fe(f) is negative 
definite. Since the sequence {x^''^ : /c G N} is bounded, there exists a constant p > such that 

Putting all above results together into (|2T|) . we have that 

Summing the above inequality from A; = 1 to +oo and using Item (i) we get the proof of Item 

(iii). ■ 

From Item (iii) of Theorem |2l we have || — Ix'^'^^lH^ — as — )■ oo. 

To further study properties of the sequence {x^''^ : k E N} generated by Algorithm \T\ the 
matrix is required to have the range space property (RSP) which is originally introduced in 
ll2n . With this property and motivated by the work in ^2T\ we prove that Algorithm [T] can yield 
a sparse solution for model (fT2] ). 

Prior to presenting the definition of the RSP, we introduce the notation to be used throughout 
the rest of this paper. Given a set S C {1,2,..., n}, the symbol 15*1 denotes the cardinality of 
S, and := {1, 2, . . . , n} \ 5 is the complement of S. Recall that for a vector u, by abuse 
of notation, we also use |m| to denote the vector whose elements are the absolute values of the 
corresponding elements of u. For a given matrix A having n columns, a vector u in M", and a 
set S C {1,2, . . . ,n}, we use the notation As to denote the submatrix extracted from A with 
column indices in S, and us the subvector extracted from u with component indices in S. 

Definition 3 (Range Space Property (RSP)): Let A be an m x n matrix. Its transpose A^ 
is said to satisfy the range space property (RSP) of order K with a constant p > if for all sets 
S C {1, . . . ,n} with \S\ > K and for all ^ in the range space of the following inequality 
holds 

Us4i<pUs\\i. 

We remark that if the transpose of an m x n matrix B has the RSP of order K with a constant 
p > 0, then for every non-empty set S C {1, . . . the transpose of the matrix Bs, denoted 
by Bg, has the RSP of order K with constant p as well. 

The next result shows that if the transpose of the matrix B in Algorithm [H possesses the RSP, 
then Algorithm [T] can lead to a sparse solution for model (fT2l . To this end, we define a mapping 
(J : M'^ — 7- M'^ such that the ith component of the vector a{u) is the ith largest component of \u\. 

Proposition 4: Let i? be the (m + 1) x n matrix be defined by (flOl) and let {x'^'^^ : k E N} 
be the sequence generated by Algorithm [H Assume that the matrix B^ has the RSP of order 
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K with p > satisfying [l + p)K < n. Suppose that the sequence {x^^^ : /c e N} is bounded. 
Then (cr(x*^*^''))„ the nth largest component of x*^*^-* converges to 0. 

Proof: Suppose this proposition is false. Then there exist a constant 7 > and a 
subsequence {x'^^^^ : j G N} such that {a{x^''^^))n > 27 > for all j E N. From Item (iii) 
of Theorem |2] we have that 

> 7 (22) 

for all sufficient large j. For simplicity, we set y^'^^^ := V F^{\x^''^^\). Hence, by inequality (|22|) 
and Fe, we know that 

|a;(fci)|>0 Ix^'^^+^^l > 0, and y^''^^ > (23) 

for all sufficient large j. In what follows, we assume that the integer j is large enough such that 
the above inequalities in (l23l) hold. 

Since the vector x^^^"^^'' is obtained through Step 1 of Algorithm [T] i.e., equation (fT6l) . then 
by Fermat's rule and the chain rule of subdifferential we have that 

= diag(|/(^-^))9|| ■ ||i(diag(?/(^^))x(^^+i)) + 
where b^''^+^'> e dic{Bx^''^+^'>). By ([23]), we get 

d\\ ■ ||i(diag(y('=^))x^'''"''^) = {sgn(x('=^+^))}, 
where sgn(-) denotes the sign of the variable element- wise. Thus 

y{kj) ^ l^e^J+l)!, 

where ^C^i+i) = ^^^(^'j+i) is in the range of B'^ . 

Let S be the set of indices corresponding to the K smallest components of |. Hence, 

i=l 

and 

n 

i=n-K+l 

Since i?^ has the RSP of order K with the constant p, we have that H'C^^^^^'' ||i < pII'C^^''^^'' l|i- 
Therefore, 

n—K n 
i=l i=n.-is:+l 

However, by the definition of cr, we have that 

n~K 

Y.iaiy^'^% > {n-K)iaiy^'^^))n-K+r 
1=1 
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and 

n 

i=n-K+l 

These inequalities together with the condition (1 + p)K < n lead to 

n—K n 

i=l i=n~K+l 

which contradicts to (l24l) . This completes the proof of the proposition. ■ 
From Proposition |4l we conclude that a sparse solution is guaranteed via Algorithm [T] if the 
transpose of B satisfies the RSP. Next, we answer how sparse this solution will be. To this end, 
we introduce some notation and develop a technical lemma. For a vector x E W^, we denote by 
r(x) the set of the indices of non-zero elements of x, i.e., t(x) := {i : Xi ^ 0}. For a sequence 
{x'^''^ : k G N}, a positive number p, and an integer k, we define /^(x'^'''^) := {i : \xf^^ \ > p}. 

Lemma 5: Let B be the (m + 1) x n matrix defined by ([10] ), let be the Log-Det function 
defined by (fT9l ), and let {x^'^'^ : A; G N} be the sequence generated by Algorithm [T] Assume 
that the matrix B^ has the RSP of order K with p > satisfying (1 + p)A' < n. If there exist 
yU > pen such that [/^(x^''"^)! > K for all sufficient large k, then there exists a A;" G N such that 
||x^'')||o < n and t{x^^~^^^) C t{x^''"^) for all k > k" . 

Proof: Set y'^^'> := V F^{\x^''^ \) . Since x^^^^^ is a solution to the optimization problem (fT6l) . 
then by Fermat's rule and the chain rule of subdifferential we have that 

G diag(|/(^))9|| • ||i(diag(y('=))x(^'+^)) + 

where b^^+^^ G 9ic(5a;^''+^^)- Hence, if "^^^ 7^ 0, we have that |/f^ = 

For i G I^{x'^^^), we have that > /i and yf^ = f^dxf'^l) < f^{p) for all G N, where 
= log(- + e). Furthermore, there exist a k' such that > for ? G I^{x'^^^) and k > k' 

due to Item (iii) in Theorem |2l Thus, we have for all k > k' 



je/^(a;('=)) 

where W* = nlime^o+ = - is a positive number dependent on p. 

Now, we are ready to prove ||a;(''^||o < n for all k > k". By Proposition HI we have that 
{a(x^''^))n when k — +00. Therefore, there exists an integer k" > k' such that |/^j(a;'^''')| > 
K and < a{x^^^))n < min{^ — e, p} for all k > k". Let io be the index such that Ix^-^^ ^| = 
{a{x''^"^))n- We will show that x-^^ = 0. If this statement is not true, that is, x-^ is not 
zero, then 

\{B^b('"^\,\ = f'MPl) = > PW*. (25) 
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However, since io is not in the set /^(x*-*^ "^) and satisfies the RSP, we have that 

ifI^,ix(k")) 

je/^Cx'C^")) 

which contradicts to (|25] ). Hence, we have that x-^^ '^^'^ = and \t{x^''"~^^^)\ < n. By replacing 
k" by k" + 1 and repeating this process we can obtain x\'^^ = for all £ G N. Therefore, 
||x||o < n for all k > k" . This process can be also applied to other components satisfying 
^(fc"+i) ^ Q_ rpj^^^ ^j^g^g g^.^^^ a A;" e N such that t{x^^^) C r(x(^'")) for all k > k" . ■ 

With Lemma [5l the next result shows that when the transpose of B satisfies the RSP there 
exists a cluster point of the sequence generated by Algorithm [U that is sparse and satisfies the 
consistency condition. 

Theorem 6: Let 5 be the (m + 1) x n matrix defined by (flOl) . let be the Log-Det function 
defined by (fT9l) . and let {x'-'"'^ : A; G N} be the sequence generated by Algorithm [T] Assume 
that the matrix B^ has the RSP of order K with p > satisfying (1 + p)K < n. Then 
there is a subsequence {x^^^'> : j G N} that converges to a [(1 + p)/i J -sparse solution, that is 
{a{x^^^'^))y(i+p)K+i\ ^ as J ^ +0O and e ^ 0. 

Proof: Suppose the theorem false. Then there exist /i*, for any < e* < there exist 
a e G (0,e*) and k' such that > /i* for all A; > k' . It implies that for all 

k>k' 

(^^'^) I > L(l + P)^ + IJ > (1 + P)K > K. (26) 

By Lemma [51 there exist a /c" > k' such that ||x*^'''||o < n and r(x*^*^+^'') C r(x^'^"^) for all 
/c > k". Let S = r(x*^'^"^). Thus x^^} = for all k > k" . Therefore, the optimization problem 
(fT6l) for updating x^''"^^^ can be reduced to the following one 



X 



G argmin{((VF,(|x('=)|))5,M) + ^((^s)^) : u G RI^I}. (27) 



If |r(x('^"')| > |/^,.(x('="))|, from ^ we have (1 +p)A' < \S\. Thus from Lemma|5]and 5j 
having RSP with the same parameters, there exist a k'" > k" such that r(x^'^^) < r(x^'^"^) for 
all A; > k'" . Therefore, by induction, there must exist a k such that for all /c > A; 

r(x('^)) = I^*{x^^^), t{x^) C r(x('^)). 

It means that for all A: > A; all the nonzero components of x*^''-' are bounded below by /i*. 
Therefore, for any k > k, the updating equation (fT6l) is reduced by (ITtT ) with 5 = I^*{x^''^). 
From Lemma |4] we get [(t(x*^'^')]|5| — which contradicts with Ixl^^jl > fx*. Therefore, we get 
this theorem. ■ 
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V. An Implementation of Algorithm [H 

In this section, we describe in detail an implementation of Algorithm [T] and show how to 
select the parameters of the associated algorithm. 

Solving problem (fT6l) is the main issue for Algorithm [1] A general model related to (fT6] ) is 

mm{\\Tx\\i + ip{Bx) : X eW}, (28) 

where F is a diagonal matrix with positive diagonal elements and (/? is in ro(M'^'^^). In particular, 
if we choose T = VF^(\x^''^\) and </? = lc, where x^'^^ is a vector in R", e is a positive number, C 
is given by (fTTI) . and is a function given by (fT3l) . then model (l28l) reduces to the optimization 
problem in Algorithm \T\ 

We solve model (l28l) by using recently developed first-order primal-dual algorithm (see, e.g., 
||6l , lfT3l , [|20ll ). To present this algorithm, we need two concepts in convex analysis, namely, the 
proximity operator and conjugate function. The proximity operator was introduced in [fTSl . For 
a function / G ro(M'^), the proximity operator of / with parameter A, denoted by pTox^^J, is a 
mapping from to itself, defined for a given point x G M*^ by 

prox^jr{x) := argmin ~ ^Wl + /(^) • ^ ^ • 

The conjugate of / G ro(M'^) is the function /* G ro(M'^) defined at ^ G M*^ by 

riz) :=sup{(x,^)-/(x):xGM"}. 

With these notation, the first-order primal-dual (PD) method for solving (|28T ) is summarized in 
Algorithm |2] (referred to as PD- subroutine). 

Theorem 7: Let i? be an (m + 1) x ri matrix defined by (fTOl ). let C be the set given by (fTTj) . 
let a and /3 be two positive numbers, and let L be a positive such that L > where ||i?|| 

is the largest singular value of B. If 

al3L < 1, 

then for any arbitrary initial vector n°) G M" x R" x R'"+^, the sequence {x*^ : A; G N} 

generated by Algorithm |2] converges to a solution of model (1281) . 

The proof of Theorem |7] follows immediately from Theorem 1 in or Theorem 3.5 in [|T3l . 
We skip its proof here. 

Both proximity operators proxQ,||.mop and prox^^, should be computed easily and efficiently 
in order to make the iterative scheme in Algorithm |2] numerically efficient. Indeed, the proximity 
operator prox„||.||j„p is given at 2; G R" as follows: for j = 1, 2, . . . , n 

(prox„||.|j^„r(^))j- = max{|2j| - a^j,0} ■ sign{zj), (29) 
where jj is the jth diagonal element of T. Using the well-known Moreau decomposition (see, 

e.g. m, m) 

prox^^* =1-13 prox 1 o I I , (30) 
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Algorithm 2 PD-subroutine (The first-order primal-dual algorithm for solving 



Input: the (m + 1) x n matrix B defined by (flOl) : two positive numbers a and (3 satisfying 
the relation a/3 < -pp-; the n x n diagonal matrix T with all diagonal elements positive; 
and the function G ro(]R"). 

Initialization: z = and an initial guess G x M™+i x 

repeat(? > 0) 

Step 1: Compute x*+^: 



Step 2: Compute u*"*"^: 



Step 3: Set i:=i + l. 

until a given stopping criteria is met and the corresponding vectors u\ and are 

denoted by m'^"'', m"'^'", and x""^^, respectively. 

Output: (u^"^ u"^"', x""""") = PD(a, /3, 5, T, ^, u^, x°) 



we can compute the proximity operator prox^^* via proxj_^ which depends on a particular form 
of the function Lp. As our purpose is to develop algorithms for the optimization problem in 
Algorithm \T\ we need to compute the proximity operator of which is given in the following. 

Lemma 8: If C is the set given by (fTTI) and /3 is a positive number, then for z E M."^'^^ we 
have that 

Prox^,, (2;) = {zi - {Zi)+, ...,Zm- {Zm)+, Z^+l " (31) 

where is s if s > and otherwise. 

Proof: We first give an explicit form for the proximity operator proxi . Note that ic = \lc 

for /3 > and l.c{z) = '•{i}(2m+i) + '■[0,00) (-^i)' for z E W"^^. Hence, we have that 

prox^,^ (z) = {{zi)+, {z2)+, {z^)+, 1), (32) 

where is s if s > and otherwise. Here we use the facts that prox^^^ (s) = (s)+ and 
prox^j^^(s) = 1 for any s G M. 

By the Moreau decomposition (l30l) . we have that prox^^* (z) = z— /3proxi^^ (^z). This together 
with equation Oil) yields dlB- ■ 

Next, we comment on the diagonal matrix T in model (|28|) . When the function in model (l28l) 
is chosen to be tc*, then the relation a(p = (p holds for any positive number a. Hence, by rescaling 
the diagonal matrix T in model (|28] ) with any positive number, that does not alter the solutions 
of model (l28l) . Therefore, we can assume that the largest diagonal entry of T is always equal to 
one. 
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In applications of Theorem |7] as in Algorithm |2l we should make the product of a and (3 as 
close to as possible. In our numerical simulations, we always set 

0.999 



a 



(33) 



In such the way, (3 is essentially the only parameter that needs to be determined. 

Prior to computing a for a given (3 by equation (l33l) . we need to know the norm of the 
matrix B. When min{m, n} is small, the norm of the matrix B can be computed directly. When 
min{m, n} is large, an upper bound of the norm of the matrix B is estimated in terms of the 
size of B as follows. 

Proposition 9: Let $ be an m x n matrix with i.i.d. standard Gaussian entries and y be an 
m-dimensional vector with its component being +1 or —1. We define an (m + 1) x n matrix B 
from $ and y via equation (flOl) . Then 



E{\\B\ 



l(v^ + 



Moreover, 



\B\\ < Vm + l{y/n + y/m + t) 



holds with probability at least 1 — 2e for all t > 0. 

Proof: By the structure of the matrix B in (flOl) . we know that 



\B\\ < 



diag(?/) 



l$l 



Therefore, we just need to compute the norms on the right-hand side of the above inequality. 
Denote by the m x m identity matrix and !,„ the vector with all its components being 1. 
Then 



y^ 



diag(y) y 



m 



m 



which is a special arrow-head matrix and has m + 1 as its largest eigenvalue (see [[1811 ). Hence, 

diag(?/) 

T 



y 



vrn+T. 



Furthermore, by using random matrix theory for the matrix $, we know that E{ || $ || } < ^Jn^^m 
and ||$|| < \fn^ y/m + t with probability at least 1 — 2e~*^/^ for all t > (see, e.g., This 
completes the proof of this proposition. ■ 
Let us compute the norm of B numerically for 100 randomly generated matrices $ and vectors 
y for the pair (m,n) with three different choices (500, 1000), (1000, 1000), and (1500, 1000), 
respectively. Corresponding to these choices, the mean values of ||i?|| are about 815, 1276, and 
1711 while the upper bounds of the expected values of by Proposition |9] are about 1208, 
2001, and 2726, respectively. We can see that the norm of B varies with its size and turns to be 
a big number when the value of min{m, n} is relatively large. As a consequence, the parameter 
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a or (3 must be very small relative to the other by equation (|33T ). Therefore, in what follows, 
the used matrix B in model (l28l) is considered to have been rescaled in the following way: 

B B 

or ^ ^ — (34) 



when the norm of B can be computed easily or not. 

The complete procedure for model ([12) and how the PD-subroutine is employed are summa 
rized in Algorithm |3] 



Algorithm 3 (Iterative scheme for model ([72]) ) 



Input: the {m + 1) x n matrix B formed by an m x n matrix $ and an m-dimensional 
vector y via (fTOl) : the set C given by (fTTI) : e G (0, 1), and r > 0; Omax and emin be two real 
numbers; the maximum iteration number k^ax- 

Initialization: normalizing B according to (|34|) : T being the nxn identity matrix; an initial 

guess (m°''^o,m'^"^o,x(°)) e X M™+^ x M"; and initial parameters (3 and a = 0.999//3. 

while < /cmax do 
Step 1: Compute 

= PD(a, /3, 5, r, Lc, ^''''^N x^'^)) 

Step 2: Update F as the scaled matrix diag(VFe(x^'^+^^)) such that the largest diagonal 
element of T is one. 

Step 3: If a < Omax, update a 2a, /3 ^ /3/2; if e > emin, update e ^ re; 

Step 4: Update k ^ k + 1. 
end while 
Output: x^'"™''"^ 



VI. Numerical Simulations 

In this section, we demonstrate the performance of Algorithm |3] for 1-bit compressive sampling 
reconstruction in terms of accuracy and consistency and compare it with the BIHT algorithm. 

Through this section, all random m x n matrices $ and length-n, s-sparse vectors x are 
generated based on the following assumption: entries of $ and x on its support are i.i.d. Gaussian 
random variables with zero mean and unit variances. The locations of the nonzero entries (i.e., 
the support) of x are randomly permuted. We then generate the 1-bit observation vector y by 
equation We obtain reconstruction of x* from y by using the BIHT and Algorithm [3l The 
quality of the reconstructed x* is measured in terms of the signal-to-noise ratio (SNR) in dB 

SNR(x, X*) = 20 log 
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The accuracy of the BIHT and Algorithm [3] is measured by the average of SNR values over 100 
trials unless otherwise noted. For all figures in this section, results by the BIHT and Algorithm |3] 
with the Mangasarian function (flTl) and the Log-Det function (fT9l) are marked by the symbols 
"V", "o", and 'V, respectively. 

A. Ejfects of using inaccurate sparsity on the BIHT 

The BIHT requires the availability of the sparsity of the underlying signals. This requirement 
is, however, not known in practical applications. In this subsection, we demonstrate through 
numerical experiments that the mismatched sparsity for a signal will degenerate the performance 
of the BIHT. 

To this end, we fix n = 1000 and s = 10 and consider two cases of m being 500 and 1000. 
For each case, we vary the sparsity input for the BIHT from 8 to 12 in which 10 is the only 
right choice. Therefore, there are total ten configurations . For each configuration, we record the 
SNR values and the numbers of sign constraints not being satisfied of the reconstructed signals 
by the BIHT. 

Figure |2] depicts the SNR values of the experiments. The plots in the left column of Figure |2] 
are for the case m = 500 while the plots in the right column are for the case m = 1000. The 
marks in each plot represent the pairs of the SNR values with a mismatched sparsity input (i.e., 
s = 8, s = 9, s = 11, or s = 12 corresponding to the row 1, 2, 3, or 4) and with the correct 
sparsity input (i.e., s = 10). A mark below the red line indicates that the BIHT with the correct 
sparsity input works better than the one with an incorrect sparsity input. A mark that is far away 
from the red line indicates the BIHT with the correct sparsity input works much better than the 
one with an incorrect sparsity input or vice versa. Except the second plot in the left column, 
we can see that the BIHT with the correct sparsity input performs better than the one with an 
inaccurate sparsity input. In particular, when an underestimated sparsity input to the BIHT is 
used, the performance of the BIHT will be significantly reduced (see the plots in the first two 
columns of Figure l2]l. When an overestimated sparsity input to the BIHT is used, majority marks 
are under the red lines and are relatively closer to the red lines than those from the BIHT with 
underestimated sparsity input. We further report that the average SNR values for the sparsity 
input s = 8, 9, 10, 11, and 12 for m = 500 are 21.89dB, 24.18dB, 23.25dB, 22.10dB, and 
21.00dB, respectively. Similarly, for m = 1000, the average SNR values for the sparsity input 
s = 8, 9, 10, 11, and 12 are 19.77dB, 26.37dB, 34.74dB, 31.12dB, and 29.46dB, respectively. 

Figure |3] (a) and (b) illustrate the histograms of the numbers of unsatisfied consistency 
conditions over 200 trials for m = 500 and 1000, respectively. We can see from Figure |3] 
(a) that the use of an underestimated sparsity constraint (s = 8 or 9) will tend to yield, on 
average, a solution with a large amount of sign constraints unsatisfied, in other words, under the 
current setting the solution to model <U} via the BIHT does not satisfy equation Q. As expected, 
when an overestimated sparsity constraint (s = 11 or 12) is used, the sign constraints are usually 
satisfied. 
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In summary, we conclude that a proper chosen sparsity constraint is critical for the success 
of the BIHT. 

B. Plan-Vershynin's model for 1-bit reconstruction 

Both our model ^ and Plan-Vershynin's model ([8]) use the same constraint conditions. Their 
objective functions are different. Our model uses the (.Q-noxm while Plan-Vershynin's model uses 
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Fig. 3. The histograms of the numbers of unsatisfied consistency conditions over 200 trials with (a) (m, n) 
(b) {m,n) = (1000, 1000). 



(500, 1000) and 



the £i-norm. As suggested in [fTTl . linear programming can be applied for the Plan-Vershynin 
model. We report here some numerical results for this model. 

In our simulations, we fix n = 1000, m = 1000, and s = 10. All simulations were performed 
100 trials. Figure |4] illustrates the sparsity of the reconstructions of all trials which are clearly 
greater than 10 (indicated by the solid red line in the figure). The average sparsity of the 
reconstructions over 100 trials is 23.42. Recall that the average SNR values of all reconstructions 
by the BIHT is 34.74dB. 




Fig. 4. Results for Plan-Vershynin's model using Linear programming over 100 trials. 



C. Performance of Algorithm\3\ 

Prior to applying Algorithm |3] for 1-bit compressive sampling problem, parameters fcmax, t, 
ttmax, emin, and e in Algorithm |3] need to be determined. Under the aforementioned setting for 
the random matrix $ and sparse signal x, we fix fcmax = 17, r = |, amax = 8000, emin = 10"^. 
For the functions defined by (flTl) and (fT9l) . we set the pair of initial parameters {a, e) as 
(500,0.25) and (250,0.125), respectively. The iterative process in the PD-subroutine is forced 
to stop if the corresponding number of iteration exceeds 300. These parameters are used in all 
simulations performed by Algorithm |3] in the rest of this section. 
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To evaluate the performance of Algorithm |3] in terms of SNR values at various scenarios, we 
consider three configurations for the size of the random matrix $ and the sparsity of the vector 
X. In the first configuration, we fix n = 1000 and s = 10 and vary m such that the ratio m/n 
is between 0.1 and 2. In the second configuration, we fix m = 1000 and n = 1000 and vary the 
sparsity of x from 1 to 20. In the third configuration, we fix m = 1000 and s = 10 and vary n 
from 500 to 1400. 

For every cases in each configuration, we compare the accuracy of Algorithm |3] with the BIHT 
by computing the average of SNR values over 100 trials. For the given parameters and stopping 
criteria adopted by Algorithm |3l the estimate x'^'^™'"'^ may not satisfy the consistency condition ([3]), 
that is the signs of measurements of the estimate x^^™'''') are not completely consistent with that 
of the original measurements. Thus, for a fair comparison, we only compute the average of 
SNR values for those trials that both reconstructions from the BIHT and Algorithm [3] satisfy the 
consistency condition Q and we say the corresponding trials are valid. 

For the first configuration, the SNR values in decibels of the average reconstruction errors 
by both the BIHT and Algorithm [3] are depicted in Figure |5l The plots demonstrate that our 
proposed algorithm performs as equally good as the BIHT, in particular, when m/n is greater 
than 1, even thought our algorithm does not require to know the exact sparsity of the original 
signal. We can see that Algorithm |3] with the Log-Det function (fT9l) (Figure |5lb)) performs 
slightly better than with the Mangasarian function (fTTT ) (Figure 13 a)) . 













BIHT 




-e-Our Algorithm 











■ * 


BIHT 




^^Our Algorithm 



(a) Mangasarian (b) Log-Det 

Fig. 5. Average SNR values vs. m/n for fixed n = 1000 and s — 10. 



Detailed descriptions for valid trials for m = 200, 800, 1400, and 2000 are displayed in 
the rows (from top to bottom) of Figure |6l respectively. The horizontal axis of each plot 
represents the sparsity of the reconstructed signals by Algorithm |3] while the vertical axis 
represents the difference of the SNR values of the reconstructions between Algorithm |3] and 
the BIHT. Therefore, the marks ("o" and "^^r") above the dashed horizontal lines indicate that 
Algorithm |3] performs better than the BIHT for the corresponding trials. Since all ideal signals 
in our simulations are 10-sparse, the marks whose horizontal axis are bigger than, exactly equal 
to, or smaller than 10 imply that the £o-norm of the reconstructions by Algorithm |3]are bigger 
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Fig. 6. The difference of SNR values of estimates by Algorithm [3] and the BIHT vs. sparsity of reconstructed estimates 
by Algorithm [I] We fix n = 1000 and s = 10. Row 1 to Row 4 are corresponding to m being 200, 800, 1400, and 2000, 
respectively. 



than, exactly equal to, or smaller than 10, respectively. Thus, the £o-norm of a reconstruction 
over 10 indicates that the reconstruction is not a global minimizer of model the £o-norm 
of a reconstruction being 10 indicates that the sparsity of the reconstruction is consistent with 
the one of the original test signal, the £o-norm of a reconstruction below 10 indicates that the 
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reconstruction is potentially a global minimizer of model <^ and the original test signal is not a 
solution to model We can conclude from Figure |6] that (i) the reconstructions by Algorithm |3] 
with sparsity higher (res. lower) than 10 usually have lower (res. higher) SNR values than that 
by the BIHT; (ii) Increasing m (number of measurements) tends to reduce the sparsity of the 
reconstructions. For example, average sparsity of the reconstructions for m = 200, 800, 1400, 
and 2000 are, respectively, 11.74, 10.26, 10, and 10.06 for Algorithm [3] with the Mangasarian 
function, and 11.53, 10.05, 9.88, 9.88 for Algorithm |3] with the Log-Det function. 

For the second configuration, the SNR values in decibels of the average reconstruction errors 
by both the BIHT and Algorithm [3] are compared in Figure |7] for varying sparsity of original 
signals. The plots demonstrate that our proposed algorithm performs better than the BIHT for 
sparsity s being 2 and 6 to 10. We emphasize again that unlike the BIHT the exact sparsity 
of the original signal is not required in advance by Algorithm [31 We remark that when s = 1 
both the BIHT and Algorithm [3] find an exact solution to model This phenomenon was also 
reported in [[T2l . Detailed descriptions for valid trials for s = 2, 8, 14, and 20 are displayed in 
the rows (from top to bottom) of Figure |8l respectively. The marks in each plot of Figure |8] 
have the same meaning as that in Figure |6l For fixed m = 1000 and n = 1000 we can draw 
conclusions from Figure [8] that (i) Algorithm |3] tends to produce an estimate whose sparsity is 
consistent with the ideal sparse signal; (ii) Algorithm |3] can give an estimate whose sparsity is 
smaller than that of the ideal sparse signal, in particular, when the sparsity of an original signal 
is relative large. 
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Fig. 7. Average SNR values vs. sparsity of original signals for fixed n = 1000 and m = 1000. 



For the third configuration, the SNR values in decibels of the average reconstruction errors by 
both the BIHT and Algorithm [3] are compared in Figure |9]for fixed m = 1000 and s = 10 and 
varying dimensions of original signals. The plots in Figure [9] show that the average SNR values 
for reconstructions by Algorithm |3] are lower than that by the BIHT in most cases. This is due 
to the fact that the BIHT explores an unattainable additional information on the sparsity of the 
original signal. Another reason which we can see from Figure [10] is that reconstructions with 
their sparsity larger than 10 by Algorithm [3] usually have lower SNR values than by the BIHT. 
The marks in each plot of Figure [JO] have the same meaning as that in Figures [6] and [8] For fixed 
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(a) Mangasarian (b) Log-Det 

Fig. 8. The difference of SNR values of estimates by Algoritlim [3] and tlie BIHT vs. sparsity of reconstructed estimates by 
Algoritiim |3] We Hx n — 1000 and m = 1000. Row 1 to Row 4 are con'esponding to s being 2, 8, 14, and 20, respectively. 



m = 1000 and s = 10 we can draw conclusions from Figure [TO] that (i) Algorithm |3]can give 
an estimate whose sparsity is smaller than that of the ideal sparse signal and (ii) Algorithm |3] 
with the Log-Det function works better than that with the Mangasarian function. 
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(a) Mangasarian (b) Log-Det 



Fig. 9. Average SNR values of estimates vs. tlie signal size n for fixed m = 1000 and s = 10. 

VII. Summary and Conclusion 

In this paper we proposed a new model and algorithm for 1-bit compressive sensing. Unlike 
the state-of-the-art BIHT method, our model does not need to know the sparsity of the signal of 
interest. We demonstrated the performance of our proposed algorithm for reconstruction from 
1-bit measurements. 

It would be of interest to study the convergence of Algorithm [3] with the Mangasarian function 
in the future. It would be highly needed to adaptively update all parameters in Algorithm |3] so 
that consistent reconstruction can be always achieved with improved accuracy of the solution. 
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